Relative Variational Intrinsic Control
نویسندگان
چکیده
In the absence of external rewards, agents can still learn useful behaviors by identifying and mastering a set diverse skills within their environment. Existing skill learning methods use mutual information objectives to incentivize each be distinguishable from rest. However, if care is not taken constrain ways in which are diverse, trivially sets arise. To ensure diversity, we propose novel objective, Relative Variational Intrinsic Control (RVIC), incentivizes that how they change agent's relationship its The resulting tiles space affordances available agent. We qualitatively analyze on multiple environments show RVIC more than discovered existing hierarchical reinforcement learning.
منابع مشابه
Variational Intrinsic Control
We introduce a new unsupervised reinforcement learning method for discovering the set of intrinsic options available to an agent. This set is learned by maximizing the number of different states an agent can reliably reach, as measured by the mutual information between the set of options and option termination states. To this end, we instantiate two policy gradient based algorithms, one that cr...
متن کاملRelative Information Based Distributed Control for Intrinsic Formations of Reduced Attitudes Relative Information Based Distributed Control for Intrinsic Formations of Reduced Attitudes
This dissertation concerns the formation problems for multiple reduced attitudes, which are extensively utilized in many pointing applications and under-actuated scenarios for attitude maneuvers. In contrast to most existing methodologies on formation control, the proposed method does not need to contain any formation errors in the protocol. Instead, the constructed formation is attributed to g...
متن کاملOn Variational Expressions for Quantum Relative Entropies
Distance measures between quantum states like the trace distance and the fidelity can naturally be defined by optimizing a classical distance measure over all measurement statistics that can be obtained from the respective quantum states. In contrast, Petz showed that the measured relative entropy, defined as a maximization of the Kullback-Leibler divergence over projective measurement statisti...
متن کاملRelative local variational principles for subadditive potentials
We prove two relative local variational principles of topological pressure functions P (T,F ,U , y) and P (T,F ,U|Y ) for a given factor map π, an open cover U and a subadditive sequence of real-valued continuous functions F . By proving the upper semi-continuity and affinity of the entropy maps h{·}(T,U | Y ) and h+{·}(T,U | Y ) on the space of all invariant Borel probability measures, we show...
متن کاملVariational Principle for Relative Tail Pressure
We introduce the relative tail pressure to establish a variational principle for continuous bundle random dynamical systems. We also show that the relative tail pressure is conserved by the principal extension.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i8.16832